DFKI-IUPR participation in TRECVID’09 High-level Feature Extraction Task

نویسندگان

  • Damian Borth
  • Markus Koch
  • Adrian Ulges
  • Thomas M. Breuel
چکیده

Run No. Run ID Run Description infMAP (%) training on TV09 data (type: A) 1 IUPR-VW-TV SIFT visual words with SVMs 8.5 2 IUPR-ADAPT-TV SIFT visual words with PA1SD 5.1 combined training on YouTube and TV09 data (type: C) 3 IUPR-VW+TT-TV SIFT visual words with SVMs, fused with TubeTagger concept detection scores 8.3 4 IUPR-ADAPT-YT SIFT visual words with PA1SD, trained on YouTube, adapted to TV09 5.1 training on YouTube data (type: c) 5 IUPR-VW-YT SIFT visual words with SVMs 3.2 6 IUPR-VW+TT-YT SIFT visual words with SVMs, fused with TubeTagger concept detection scores 3.2 Similar to our TRECVID participation in 2008 [23], our main motivation in TRECVID’09 is to use web video as an alternative data source for training visual concept detectors. Web video material is publicly available at large quantities from portals like YouTube, and can form a noisy but large-scale and diverse basis for concept learning. Unfortunately, web-based concept detectors tend to be inaccurate when applied to different target domains (e.g., TRECVID data [24]). This “domain change” problem is the focus of this year’s TRECVID participation. We tackle it by introducing a highly-efficient linear discriminative approach, where a model is initially learned on a large dataset of YouTube video and then adapted to TRECVID data in a highly efficient on-line fashion. Results show that this cross-domain learning approach (infMAP 5.1%) (1) outperforms SVM detectors purely trained on YouTube (infMAP 3.2%), (2) performs as good as the linear discriminative approach trained directly on standard TRECVID’09 development data (infMAP 5.1%), but (3) is outperformed by an SVM trained on standard TRECVID’09 development data (infMAP of 8.5%).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Participation at TRECVID 2011 Semantic Indexing & Content-based Copy Detection Tasks

Semantic Indexing Task (SIN) Run No. Run ID Run Description infMAP (%) 1 F A IUPR-DFKI 1 Fisher Kernel + SVMs 2.86 2 F A IUPR-DFKI 2 Color Correlogram + SVMs 5.38 3 F A IUPR-DFKI 3 Fisher Kernel fused with Color Correlograms + SVMs 5.0 4 F A IUPR-DFKI 4 Fisher Kernel + kNN 0.71 Content-based Copy Detection (CCD) Run No. Run ID Run Description Opt.NDCR 1 *iupr-dfki.fsift F-SIFT+BoW+HE+EWGC 0.776...

متن کامل

Learning TRECVID'08 High-Level Features from YouTube

Run No. Run ID Run Description infMAP (%) training on TV08 data 1 IUPR-TV-M SIFT visual words with maximum entropy 6.1 2 IUPR-TV-MF SIFT with maximum entropy, fused with color+texture and motion (NN matching) 5.9 3 IUPR-TV-S SIFT visual words with SVMs 5.3 4 IUPR-TV-SF SIFT with SVMs, fused with color+texture and motion (NN matching) 6.3 training on YouTube data (no use of standard training set...

متن کامل

TZI Bremen - Trecvid 2006 high level feature extraction

In this paper, the system developed by the University of Bremen for participation in the Trecvid 2006 high-level feature extraction task is presented. Six runs have been submitted, each of them incorporating a different combination of three classifiers based on image, sound, and text features. For the feature Corporate Leader, aboveaverage results could be achieved. Results are shown and differ...

متن کامل

IRIM at TRECVID 2008: High Level Feature Extraction

The IRIM group is a consortium of French teams working on Multimedia Indexing and Retrieval. This paper describes our participation to the TRECVID 2008 High Level Features detection task. We evaluated several fusion strategies and especially rank fusion. Results show that including as many low-level and intermediate features as possible is the best strategy, that SIFT features are very importan...

متن کامل

XJTU at TRECVID2008 High-Level Feature Extraction

In this paper, we present our experiments in TRECVID 2008 about High-Level feature extraction task. This is the first year for our participation in TRECVID, our system adopts some popular approaches that other workgroups proposed before. We proposed 2 advanced low-level features NEW Gabor texture descriptor and the Compact-SIFT Codeword histogram. Our system applied well-known LIBSVM to train t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009